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So far, a very large amount of work in Natural Language Processing (NLP) rely on trees as the core 
mathematical structure to represent linguistic informations (e.g. in Chomsky's work). However, 
some linguistic phenomena do not cope properly with trees. In a former paper, we showed the 
benefit of encoding linguistic structures by graphs and of using graph rewriting rules to compute on 
those structures. Justified by some linguistic considerations, graph rewriting is characterized by two 
features: first, there is no node creation along computations and second, there are non-local edge 
modifications. Under these hypotheses, we show that uniform termination is undecidable and that 
non-uniform termination is decidable. We describe two termination techniques based on weights and 
we give complexity bound on the derivation length for these rewriting systems. 



1 Introduction 
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Linguists introduce different levels to describe a natural language sentence. Starting from a sentence 
given as a sequence of sounds or as a sequence of words; among the linguistic levels, two are deeply 
considered in literature: the syntactic level (a grammatical analysis of the sentence) and the semantic 
level (a representation of the meaning of the sentence). These two representations involve mathematical 
structures such as logical formulae, A -terms, trees and graphs. 

One of the usual ways to describe syntax is to use the 
notion of dependency |[T6l . A dependency structure is an ^^^^p 
ordered sequence of words, together with some relations 
between these words. For instance, the sentence "/ see 
that Mike begins to work" can be represented by the struc- 
ture on the right. 

There is a large debate in the literature about the mathematical nature of the structures needed for 
natural language syntax: do we have to consider trees or graphs? Trees are often considered for their 
simplicity; however, it is clearly insufficient. Let us illustrate the limitations of tree-representations with 
some linguistic examples. Consider the sentence "Bill expects Mary to come" , the node "Mary" is 
shared, being the subject of "come" and the object of "expects" (below on the left). The situation can 
be even worse: cycles may appear such as in the sentence below where edges in the cycle are drawn with 
dashed line (below on the right). 
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For the semantic representation of 
natural language sentences, first order 
logic formulae are widely used. To 
deal with natural language ambiguity, 
a more compact representation of a set 
of logic formulae (called underspeci- 
fied semantic representation) is used. 
DMRS [4l is one of these compact rep- 
resentation. The DMRS structure for 
the sentence "The Dog whose toy the 
cat bit barked" is given in the figure on 




the right. 

To describe transformations between syntactic and semantics structures, there are solutions based on 
many computational models (finite state automata, A -calculus). It is somewhat surprising that Graph 
Rewrite Systems (GRS) have been hardly considered so far (lUl [T] |5l |9|). To explain that, GRS im- 
plementations are usually considered to be too inefficient to justify their extra-generality. For instance, 
pattern matching does not take linear time where this is usually seen as an upper limit for fast treatment. 

However, if one drops for a while the issue of efficiency, the use of GRS is promising. Indeed, 
linguistic considerations can be most of the time expressed by some relations between a few words. 
Thus, they are easily translated into rules. To illustrate this point, in ||3l[T3, we proposed a syntax to 
semantics translator based on GRSs: given the syntax of a sentence, it outputs the different meaning 
associated to this syntax. 

In the two earlier mentioned studies, we tried to delineate what are the key features of graph rewriting 
in the context of NLR Roughly speaking, node creation are strictly restricted, edges may be shifted from 
one node to another and there is a need for negative patterns. Based on this analysis, we define here a 
suitable framework for NLP (see Section |3]l. 

Compared to term rewriting, the semantics of graph rewriting is problematic: different choices can 
be made in the way the context is glued to the rule application ifTSl . As far as we see, our notion does 
not fit properly the DPO approach due to unguarded node deletion nor the SPO approach due to the shift 
command, as we shall see. Thus we will provide a complete description of our notion. We have chosen 
to present it in an operational way and we leave for future work a categorial semantics. 

In our application, we use several hundreds of rules. To manage such a system, we use a notion of 
modular graph rewriting system: the full set of rules is divided in smaller subsets (called modules) that 
are used in turn. 

In practice, we need some tools to verify termination and confluence properties of modules. In 
Section [4j we provide two termination methods based on a weight analysis. First, there is a direct moti- 
vation: in our NLP application, any computations should terminate. If it is not the case, it means that the 
rules where not correctly drawn. Then, termination ensures partly the correctness of the transformation. 
There is also an indirect reason to consider termination: one way of establishing confluence is through 
Newman's Lemma [ 1 1] which requires termination. 

We consider two properties of the above mentioned termination methods. First, we show that they 
are decidable, that is the existence of weights can be computed statically from the rules, and thus we have 
a fully automatic tool to verify termination. Obviously, it is not complete. In a second step, we evaluate 
the strength of the two methods. To do that, we consider what restrictions they impose on the length of 
computations. We get quadratic time for the first method, polynomial time for the second. This article is 
an extended abstract of IS • 
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2 Linguistic motivations 

Without any linguistic exhaustivity, we highlight in this section some crucial points of the kind of linguis- 
tic transformation we are interested in and hence the relative features of rewriting we have to consider. 



Node preservation property. As linguistic examples above suggest, the goal of linguistic analysis is 
mainly to describe different kinds of relations between elements that are present in the input structure. 
As a consequence, the set of nodes in the output structure is directly predictable from the input and only 
a very restrictive notion on node creation is needed. In practice, these node creations can be anticipated 
in some enriched input structure on which the whole transformation can be described as a non-size 
increasing process. 
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Edge shifting. In the first example of the introduction (for 
the sentence "/ see that Mike begins to work"), the verb 
"begins " is called a raising verb and we know that "Mike " 
is the deep subject of the verb "work"; "begins" being con- 
sidered as a modifier of the verb. To recover this deep subject, one may imagine a local transformation 
of the graph which turns the first graph on the right into the second one. 

However, in our example above, a direct application of such a transformation leads to the structure 
below on the left which is not the right structure. Indeed, the transformation should shift what the 
linguists call the head of the phrase "Mike begins to work" from the word "begins" to the word "work" 
with all relative edges. In that case, the transformation should produce the structure below on the right: 
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In a more general setting, our transformations may have to specify the fact that all incident edges of 
some node X must be transported to some other node Y . We call this operation shift. 

To describe our graph rewriting rules, we introduce a system of commands (like in f6l) which ex- 
presses step by step the modifications applied on the input graph. The transformation described above is 
performed in our setting as follows: 



r 
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del_edge (Y,SUBJ,X); del_edge (Y,COMP,T); 
add.edge (T,SUBJ,X); add.edge (T,MOD,Y); 
shift (Y,T) 



Negative conditions. In some situation, rules must be aware of the context of the pattern to avoid 
unwanted ambiguities. When computing semantics out of syntax, one has to deal with passive sentence; 
the two sentences below show that the agent is optional. 

SUBJ SUBJ 

DET f AUX ] PET f AUX ] AGT OBJ 

i ll I ll i, ll i ill ll ; 

The banana was eaten The banana was eaten by Mike 
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In order to switch to the corresponding active form, two different linguistic transformations have to 
be defined for these two sentence; but, clearly, the first graph is a subgraph of the second one. We don't 
want the transformation for the short passive on the left to apply on the long passive on the right, we 
need to express a negative condition like "there is no out edge labeled by AGT out of the main verb" to 
prevent the unwanted transformation to occur. 
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Long distance dependencies. Most of the lin- 
guistic transformation can be expressed with suc- 
cessive local transformation like the one above. -r^^ ,.,^,,,0,, o^^rv,o 1^,,^ 

The woman whom John seems to love 

Nevertheless, there are some cases where more 

global rewriting is required; consider the sentence "The women whom John seems to love", for which 
we consider the syntactic structure on the right. One of the steps in the semantic construction of this 
sentence requires to compute the antecedent of the relative pronoun "whom " (the noun "woman " in our 
example). 

The subgraph we have to search in our graph (which ^ 2§J ^^(OBJicoMPr ^ mod_rel ^ 

is depicted as a non-local pattern) and the graph modifi- x Y z T 

cation to perform are given on the right. The number of PRO_REL 
OBJ or COMP relations to consider (in the relation de- add_edge (X,ANT,T) 

picted as (OBJ|COMP)* in the figure) is unbounded (in 

linguistics, this phenomenon is called long distance dependencies); it is possible to construct grammati- 
cal sentences with an arbitrary large number of relations. 

As we want to stay in the well-known framework of local rewriting, we will use several local trans- 
formations to implement such a non-local rule. 

OBJ TMP OBJ TMP COMP TMP M0D_REL 

i 1 I M, I I U I I U I 

X yXYZXYZXZT 
PRO_REL 

del_edge (X,TMP,Y) del_edge (X,TMP,Y) del_edge (X,TMP,Z) 
add_edge (X.TMP.Y) add_edge (X,TMP,Z) add_edge (X,TMP,Z) add_edge (X,ANT,T) 

The second and the third rules above preserve the set of nodes and the number of edges of each kind. 
Hence, this kind of rule will require special treatment with respect to termination issues. 



3 Graph Rewriting for NLP 

Before we enter into the technical sections, let us define some useful notations. First, we use the notation 
c to denote sequences. The empty sequence is written 0. The length of a sequence is denoted by \c\. We 
use the same notation for sets: the empty set is denoted and the cardinality of a set S is written l^l. The 
context will make clear whether we are talking about sequences or sets. 

Given a function / : X ^ F and some sets X' C X and Y' C Y, we define f{X') = {/(x) \xeX'} and 
/-I (F') ^{x(eX\ f {x) G Y'}; the restriction of the function / to the domain X' is f\^, :x'€X'^ f{x'). 
The function cx:xgXi— ^cGFis the constant function on X. The identity function is written 1. Finally, 
given a function / : X — >■ 7 and {x,y) G X x 7, the function f[x i-)- _y] maps t ^xio f{t) and x to y. 

The set of natural numbers is N, integers are denoted by Z. Given two integers a^b, we define 
[a,b] = {xe'L\a<x<b}. 
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3.1 Graphs 

The graphs we consider are directed graphs with both labels on nodes and labels on edges. We restrict 
the edge set: given some edge label e, there is at most one edge labeled e between two given nodes a 
and j3 . This restriction reflects the fact that, in NLP application, our edges are used to encode linguistic 
information which are relations. We make no other explicit hypothesis on graphs: in particular, graphs 
may be disconnected, or have loops. 

In this paper, we suppose given a finite set of edge labels and another finite set of node labels. 

Definition 3.1 (Graph). A graph G is defined as a triple S) where 

• is a finite set of nodes; 

• ^ is a labeling fiinction: £ : i— )• L^; 

• S' is a set of edges: (f C ^ x x 

Let n,m € and e G Eg. When there is an edge from n to m labelled e {i.e. {n,e,m) £ S), we write 
n — > m or n — > m if the edge label is not relevant. If G denotes some graph {,J^ ,£,(d), then .J/g^ 
(aQ denote respectively ^ ,1 and S. 

Definition 3.2 (Graph morphism). A graph morphism p, from the graph G = {.yV ,£,S) to the graph 
G' = {,yy',i', S") is a function from .jV to .JV' such that: 

• for all n(^ £' {\l(n)) = l(n); 

• for all n,m £ ^yV and e G Le, ifn — ^ m £ (o then p{n) — ^ M('m) G S". 

A graph morphism p is said to be injective ;/ p{n) = p{m) implies n = m. We make the follow- 
ing abuse of notation: given some graph morphism p : G ^ G', and a set E Q Sq, we let p{E) = 
{p{n) p{m) I n m G E}. 

Definition 3.3 (Basic pattern and basic matching). A basic pattern B is a graph. A basic matching p of 
the basic pattern B in the graph G is an injective graph morphism p (written p . B ^ G). 

As shown in Section |2j negative conditions on patterns naturally arise in NLP. We classify negative 
conditions in two categories: the local ones, that is negative conditions on edges within the basic pattern 
and non-local ones, that is negative conditions concerning edges between a node of the basic pattern and 
a node of the context (either in-edges or out-edges). 

Definition 3.4 (Pattern). A pattern is a quadruple P = {B, S", J^, G) of: 

• a basic pattern B = {^p,ip, Sp); 

• a set of forbidden edges § C .jVp x x .jVp such that S 'r\Sp = 

• a set of forbidden in-edges J' C .jVp x 

• a set of forbidden out-edges G C jVp x 

Given a basic pattern B, we shorten (B, 0,0,0) to (B,0). In the following, given a pattern P, ,jVp and 
Sp denote respectively the set of nodes of its basic pattern and the set of edges of its basic pattern. 

Definition 3.5 (Matching). Let P = {B, J^, G) be a pattern, G = S") be a graph, and p.B^G 

be a basic matching. We say that p is a matching /rom P into G (also written p : P ^ G) whenever it 
satisfies the additional three conditions: 

• p{<^)r\^ = (l) 
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• for each {n,e) £ y, {p £ ^\n{J/p) \ p = 

• for each {n,e) £ ^, {p e J^\}x{J^p) \ }x{n) p}=% 

Example 3.1. Negative conditions are used to remove 'unwanted' matching. To see their effect, consider 
for instance the basic pattern Bq and its two basic matchings }X\ and }X2 in Go: 




• First, let Pq = {Bq,(1>). Then, /li and II2 cif^ (the) two matchings Pq Gq. 

• Second, let the pattern Pi = (So,{(Z7i,C,Z7o)},0,0); then, /li is the only matching Pi ^ Gq. 

• Third, let the pattern Pj = {BQ,(Z),{{bQ,D)},{{bQ,D)}). Then, there is no matching ofPj into Gq. 

In the following, patterns Pi and P2 are depicted as: 




3.2 Graph decomposition 

The proper description of actions of a rule on some graph G requires first the definition of two partitions: 
one on nodes and the other on edges. They are both induced by the matching of some pattern P into G. 

Definition 3.6 (Nodes decomposition: pattern image, crown and context). Let pL : P ^ G a matching 
from the pattern P into the graph G = Nodes of G can be split in a partition of three sets 

^ = ^lx®-J^^®%: 

• the pattern image is S^p, = /i(^); 

• the crown contains nodes outside the pattern image which are directly connected to the pattern 
image: ^ = {n £ ^\ \ 3p G such that n — > p or p — > n}; 

• the context contains nodes not linked to the pattern image: 'if^ = JV \ UJ^). 

Definition 3.7 (Edges decomposition: pattern edges, crown edges, context edges and pattern-glued 
edges). Let \i:P ^ G a matching from the pattern P into the graph G = {,yy,i,(o). Edges ofG can be 
split in a partition of four sets S = lx{Sp) Jf^ ®'t^^ ® J^^: 

• the pattern edges is pi{<^p); 

• the crown edges set contains edges which links a pattern image node to a crown node: J{f^ = 
{n — >m£<^\n£ ^^^m£ J^}U{?i — >m£ | « G Am G ^^}; 

• the context edges set contains edges which connect two nodes that are not in the pattern image: 

={n — >m£^\n^ &>y,Nm^ ^^}. 

• the pattern-glued edges set contains edges which are not pattern edges but which connect two 
nodes that are in the pattern image: M'^ = {n — > m £ \ n £ Am G ^fi} \lJ.{^p). 
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3.3 Rules 

In our graph rewriting framework, the transformation of the graph is described through some atomic com- 
mands (like in 10). Commands definition refer to some pattern P and pattern nodes are used as iden- 
tifiers. Leta,b S ^p, a and e € S^, the five kinds of commands are label(fl;, a), del_edge{a,e,b), 
a.dd_edge{a,e,b), deljiode(a) and shift(fl,^). 

Their names speak for themselves, however, we will come back to their precise meaning in the 
subsection below. Before this, to ensure that commands always refer to valid node identifiers, we restrict 
command sequences to consistent sequences, that is sequences ci, . . . ,Cyt such that for each command c,-, 
I <i <k, which is a node deletion command del_node(a) for some a G ^p, then the node name a does 
not occur in any command cj with i < j < k. 

Definition 3.8 (Rule). A rule R is a pair R = {P,c) of a pattern P and a sequence of commands c 
consistent with respect to P. A rule R = {P,c) is said to be node-preserving ifc does not contain any 
del_node command. 

3.4 Graph Rewrite System 

Let G = (f ) a graph, R = {P,c) a rule and p. : P ^ G a. matching. The application of the sequence 

c on G is a new graph which is written G-^c (shortened G ■ c when p is clear from the context) and is 
defined by induction on the length kofc.lfk = 0,G-® = G.lfk>0, let G' = {^',1', S") be the graph 
obtained by application of the sequence ci , . . . , c^^ i ; then we consider each command in turn: 

Label: The command Ck = label(a, a) changes the label of the node p{a): G -c = {^',£", S") with 
£" = e'[p{a)^a]. 

Delete: The command q = del_edge{a,e,b) deletes the edge from p{a) to p{b) labelled with e S E^; 
G-c = with f^" = ^' \{p{a) ^ p{b)}. 

Add: The command q = cLdd_edge{a,e,b) adds an edge from p{a) to p{b) labelled with e E Z^; 
G-c = with ^" = ^' U{pia) ^ p{b)}. 

Delete node: The command Ck = deljiode(a) removes the node p{a) oiG';G-c= ,i" ,£"') with 
= \ {p{a)}, r = ey^„ and (f" = S'r\{,y^" xLeX ^"}. 

Shift edges: The command Ck = shift {a, b) changes in-edges of p{a) starting from the crown to in- 
edges of p{b) and all out-edges of p{a) going to the crown to out-edges of p{b). Formally, 
G-c = with the set ^" defined by, for all e G If: 

• for all p e Jf^, p{b) ^ p e S" ifi p{b) ^ p e S' or p{a) ^ p e S'; 

• for all p G 7? ^ p{b) G iff p ^ p{h) ^ S' or p ^ p{a) ^ S'; 

• for all G p ^ <? G (f" iff p ^ <7 G S'\ 

• for all G ^ U<r^, ^ ^ G (f" iff p ^ ^ G S' . 

The commands label, del_edge and add_edge are called local commands: they modify only the 
edges and the nodes described in the pattern. The commands deljnode and shift are non-local: they 
can modify edges outside the pattern. Note that a rule add_edge (resp. del_edge) may have no effect 
if the edge already exists (resp. does not exist). Note also that we can suppose that for a given sequence 
c and a given triple {a,e,b), there is at most one rule del.edge{a,e,b) or add_edge(c?,e,Z?) in c (if not, 
only the last one is effective). Hence, we can define uniform rules: 
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Definition 3.9 (Uniform rule). For c = c\, . . . ,Ck without node deletion, the rule {P,c) is uniform iff for all 

\ <i <k, ifcj = add_edge(?i,e,m) then {n,e,nt) G S'p andifcj = del_edge(?i,e,m) then (n,e,m) G (fp. 

Definition 3.10 (Rewrite step). Let G = (=yK,£,(f ) a graph, R = {P,c) a rule and n : P ^ G a matching. 
Let G' = G c, then we say that G rewrites to G' with respect to the rule R and the matching }X. We write 
it G — G' or G — G' or even simply G — )• G' . 

Definition 3.11 (Graph Rewrite System). A Graph Rewrite System is a finite set of rules. 

In our application, the translation of the syntax to semantics is split into several independent levels 
of transformation driven by linguistic consideration (such as translation of passive forms to active ones, 
computation of the deep subject of infinites). Rules are then grouped in subsets called modules and 
modules apply sequentially; each module being used as a graph rewrite system on the outputs of the 
previous module. 

Lemma 3.1 (Linear modification). Given a GRS ^ , there is a constant C > such that, for any rewriting 
step G -^R,n G' the two canonical corresponding edge decompositions S'g = 'rf^ © and ^Qt = © 
^'^^ satisfy: 

\^^\<Cx {\G\ + 1) and |^'^| < C x + 1) 

Proof. Let C = max{2 x |Pp x \Y.e\] \ {P,c) G ^}. Both in G and G', edges that are not in the context are 
either between two pattern nodes or between a pattern node and a crown node. The total number of edges 
of the first kind (either pattern edges or glued-pattern edges) is bounded hy \P\ x IZ^I. For each pattern 
node, the number of edges which connect this node to some non-pattern node is bounded by 2 x | G| x | 
and so the total number of edges which link some pattern node to some non-pattern node is bounded by 
2 X |G| X ll^l X \P\. Putting everything together, \^''\<Cx {\G\ + 1) and < C x (|G| + 1). □ 

4 Weighted GRS 

We recall that a GRS is said to be (strongly) terminating whenever there is no infinite sequence Gi — 
G2 . Given a terminating GRS and a graph G, we define the derivation height of G, next denoted 
hc^{G), to be the length of the longest derivation starting from G if such a derivation exists. If hcg{G) is 
defined for all G such that \G\ <n, then we define the derivation height of by: = max{/ig?(G) | 

|G| <«}. 

Actually, for non-size increasing GRS as presented above, we have immediately the decidability 
of non-uniform termination. That is, given some GRS and some graph G, one may decide whether 
there is an infinite sequence Gi G2 —)• G3 —)•••• . Indeed, one may observe that for such sequence, 
for all / G N, < Thus, the G,'s range in the finite set <B<|g| of graphs of size less or equal to 
|G|. Consequently, either the system terminates or there is some j < \(5<\g\ \ and some k < j such that 
Gj = Gk. To conclude, to decide non-uniform termination, it is sufficient to compute all the (finitely 
many) possibilities of rewriting G in less than |<B<|g| | steps and to verify the existence of such a j and k 
above. Finally, since |<5<|g| | < 2^(1*^1^', the procedure as described above takes exponential time. 

However, uniform termination — given a GRS, is it terminating? — of non-size increasing GRS re- 
mains an open problem. Uniform termination was proved undecidable when we drop the property of 
non-size increasingness (cf. Plump [14J). As a consequence, there is a need to define some termina- 
tion method pertaining to non-size increasing GRS. Compared to standard work in termination ifTBl lTll. 
there are two difficulties: first, our graphs may be cycUc, thus forbidding methods developed for DAGs 
such as term-graphs. Second, using term rewriting terminology, our method should operate for some 
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non-simplifying GRS, that is GRS for which the output may be "bigger" than the input. Indeed, the NLP 
programmer sometimes wants to compute some new relations, so that the input graph is a strict sub-graph 
of the resulting graph. 

4.1 Termination by weight analysis 

In the context of term-rewriting systems, the use of weights is very common to prove termination. There 
are many examples of such orderings, Knuth-Bendix Ordering lITOll to cite one of them. We recall that 
all graphs we consider are defined relatively to two signatures of edge labels and of node labels. 

Definition 4.1 (Edge weight, node weight). An edge weight is a mapping w : — )• Z. Given some 
subset E of edges of G, the weight of E is w{E) = L„_i^me£:^(^-'' edge weight of a graph G is 
w{G) = w{i§q). a node weight is a mapping r\ : Z^v — ^ For a graph G = {^g^^Gi'^g)' we define 
r]{G) =Zne.A'aWG{n)). 

Let us make some observations. Let \G\^ denote the number of edges in G which have the label e, 
then w{G) = LeeZf w{e) x |G|g. Second, for a pattern matching ji -.P^G, w{pL{P)) = w(P). 

The weight of a graph may be negative. This is not standard, but it is useful here to cope with non- 
simplifying rules, that is rules which add new edges. Since a graph G has at most |r£-| x |G| edges, the 
following lemma is immediate. 

Lemma 4.1. Given an edge weight w and a node weight r\, Zef = maxg(=5;^(|w(e)|), Ke = \Le\ x K^, 
Kjj =max„ei:A,(|T](a)|), then 

(a) for each subset of edges E d Sg of some graph G, we have w{E) < x j^l. 

(b) for each graph G, we have —Ke x |Gp < w{G) <KeX \G\^; 

(c) for each graph G, we have < ^ij x \G\. 

Definition 4.2. Let R = {P,c) a rule, we define inductively : jVp — t- jVp which describes the global 
effect of the shift commands in a rule: = 1; *J*?.shift(m,n) = 1 ['w i— t- n] o <I>j and = ifc is not 
a shift command. 

Definition 4.3 (Compatible weight). Given a rule R = ((P, (f , J^, ff),c), an edge weight w is said to be 
compatible with R if: 

1. either c contains a del_node command 

2. or R is a node-preserving rule and satisfy the three properties: 

(a) R is uniform, 

(b) w{P-ic)<w{P), 

(c) for all e £Le such that w{e) < 0, for all n G <t>{^p), let Mn C be the set <t>Z^ (n); then Mn 
contains at most one element m such that {m,e) and M„ contains at most one element 
m' G Mn such that {m' ,e) ^. 

An edge weight is said to be compatible with a GRS ^ if it is compatible with all its rules. A weighted 
GRS is a pair (^^,h') of a GRS and a compatible weight. 

Hypothesis (2.b) will serve to manage edges in the pattern images while Hypothesis (2.c) will serve 
for the crown edges. One may note that when there is no shift commands in the rule, the Hypothesis 
(2.c) holds whatever w is. Indeed, in that case, <t> is the identity function and all the sets M„ are singletons. 



G. Bonfante and B. Guillaume 



13 



Lemma 4.2. Let{W,w)a weighted GRS, letG^G' tea rewrite step ofW. Either | G | > | G' | or | G | = | G' | 
and w{G) > w{G'). 

The problem of the synthesis is the following. Given a GRS W, is there a weight w compatible with 
1^? Since the existence of weights can be described in Presburger's arithmetic, we have a positive answer: 

Theorem 4.1. Given a GRS 1^, one may decide whether or not it has a compatible weight. 

Second point, the existence of weights induce termination: 

Theorem 4.2. Any weighted GRS is strongly terminating in quadratic time. Moreover, this 

quadratic bound is a lower bound: there is a GRS ^ with a compatible weight such that hcg{n) > 0{n^). 



Condition (2.c) of Definition 4.3 is necessary. Here is a counter-example of a non-terminating system 
with a compatible weight up to this condition. Consider the two rules {Qi , ci ) and {Q2,C2 ) : 

C 




del_edge(0,A,l) 
shift(0,l) 




add_edge(3,A,4) 
add_edge(5,C, 3) 



Set w(A) = w(B) = 1 and w{C) = -2. Observe that w{Qi ■ci) = \<2 = w{Qi) and w{Q2 ■ C2) = -2 < 
— 1 =w{Q2). However, there is an infinite sequence Gi -^R^ G2 —i^ftj Gi • with Gi and G2 being: 



G2 





Proof sketch of Theorem 4.2 We begin to show the lower bound. Let = {£"}, Z/v = {^}- Consider 



the two rules GRS defined by the two basic patterns: 

0:e j ^ ( l:e j del_edge(0,£, 1) E Of 2:e) del_edge(2,£,2) 




Set w{E) = 1. The rules are compatible with w. Each rule deleting exactly one edge, since the clique 
Cn of size n has n^ edges, the derivation height h<^{C„) = n^. The lower bound follows. 

For the upper bound, let C be the constant as defined by Lemma [TT] let K = m&x{\,K„) (we recall 
that/^„, = maxee2:£(|w(£>)|)). Finally, let // = max{« | (^,01, . . . ,c„) G?^}. LetA = 2xi^xCx {H + \) + 
1. Let Q. be the 'energy function' defined on graphs Q.{G) = w{G) +A x |Gp. For each rule application 



G — G', one may verify that ^1(G) > D.{G'). The last inequality together with Lemma 4.1 leads to the 



conclusion. □ 
Full proofs of Theorem 4. 1 and of Theorem 4.2 are given in ||2l. 



4.2 Termination by lexicographic weight 

In our experiments, in most cases, the weight analysis of the preceding section was sufficient. The main 
counter-example is however systems composed of rules as given in Section [2] The GRS is strongly 
terminating but there is no compatible weight. This section provides a conciliable extension of this ter- 
mination proof method. With a little abstraction, the linguistic example of Section [2] about long distance 
dependencies is computed by some 'non-local rule' /?„/: 
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Rni — 




A 




add_edge (ho,A,b''2) 



Such non-local rules can be implemented by rules: 



INIT 



Rec 



Stop 



Clean 




label(i)o,-P<>) 
add_edge {bo,E,bi) 





add_edge {bo,E,b2) 



add_edge(/7(),A,&2) 
label(/;o,P) 




Figure 1 : Local implementation of the non-local rule 



However, these rules are not compatible with any weight. Actually, as justified in 121, there is no imple- 
mentation of such a rule by some weighted rules. 

Given an order -< on some set U , its lexicographic extension to sequences in U is defined by 
. . .,uji) -<iex (vi, . . . ,Vm) iff 3j < min(m,^) : Uj -< vj A V/ < j : m,- = v,-. The order -<iex is not well- 
founded in general, but its restriction to sequences of equal length is such as soon as -< is well-founded. 

Definition 4.4 (Contextual weight). An edge contextual weight is a (finite) map ft) : Zyy x x Z^y — ^ ^■ 

As for weights, it extends to any set E C Sq of some graph G by: CO{E) = L„_^„,££ CO{i{n),e,£{m)). And 
the weight of a graph is (0{G) = (o{S'g). 

A contextual weight is a 4-tuple n = {a,(0,b,ri) with a,b CO an edge contextual weight and r\ a 
node weight. We define 7i{G) =ax (o{G) + bxr\ (G). 

Let e € T^, if a ^ and there are Of, j8, o;', j3' G T,n such that (o{a,e,l5) ^ (o{a' ,e,p'), then we say 
that 71 is e-fragile. 

Definition 4.5. Given an edge weight wq : — >■ given k contextual weights Ki,. . . ,Kic and a rule 
R = {P,c), we write P' = P-ic. We say that R is compatible with (wq, K\,...,Kk) iff: 

1. either c contains a del_node command, 

2. or R is an uniform and node-preserving rule such that: 

(a) either the two properties below hold 

(i) wo{P') < wo{P); 

( ii) and for all e G such that w{e) < 0, for all n € <t>{^Ap), let M„ be the set <I>~^ (n); then 
Mn contains at most one element m such that {m,e) andMn contains at most one 
element m' G M„ such that {q,m') ff. 

(b) or the four properties below hold 

(i) wo{P')=wo{P); 

(ii) {71, [P'),..., 7lk{P')) <le. (TTl (/'),..., nu{P)); 
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(Hi) if c contains a command \zhel{n, a) and if some Hi is e-fragile, then {n,e) GJ^Uff; 
(iv) c does not contain any shift commands. 

When a weight wq and k contextual weights are compatible with all the rules of some GRS we 
say that §f is lexicographically weighted by (wq, tti , . . . , 7iic). 

Example 4.1. We define wq = ©^^[A i-^ -1], and CO = Oznx-LexZnKP^^ ^ ^7{Po,E,X) i-> -1]. Con- 
sider the lexicographic weight n = [1,(0, 0,0^^)- For rules in Figure^ we have: rule Stop decreases 
by (2. a); rules INIT and Rec decrease by (2.b): there is one more edge labeled E starting from Po and 
rule Clean decreases by (2.b): one edge labeled E starting from P disappears. 

Theorem 4.3. Whenever a program ^ is compatible with the lexicographic weight (wq, TCi,.. . , TC^), it is 
strongly terminating in polynomial time. The bound is tight, that is for all k > 0, there is a GRS whose 
derivation height is O^n'^). 

Proof. Examples for the lower bound are proposed in [2|. For the upper bound, let 

K(o = max{|G)(«,e,/M)| I {n,e,m) G Eat x x £/v} and Kj^ = ax \Le\ x Kg) +b x Krj 



Then, ad aptin g Lemma 4.1 b]l to the present context, we can state that |ft)(G)| < K(q x IZ^I x \G\ 



4.1 

7 



2 



c|), we have |t](G)| <KrjX \G\ and finally |;r(G)| <ax K^x |££| x\G\ +b x Kj^ 



With Lemma 
\G\ <K„x\G 

Let ^0 = msiXj^[ij^]{Kji.). Finally, let Ke be the constant as given by Lemma 4.1 for wq, we define 
K = m£Lx{Ko,KE). Then, for all / < k, we have: |7r;(G)| <Kx \G\^ and |wo(G)| <Kx \G\^-. 

Let k{G) = {\G\,WQ{G),Ki{G),...,nk{G)). If G ^ G', then k{G) > k{G'). Consider a sequence 
Gi — G2 — 5- • • • . For all graph G,- of the sequence, |G,| < |Gi|. Due to previous equations, K'(G;) is 
ranging in L= [0,1 Gi|] x x |Gi p,/*: x |Gi Thus the result. □ 



5 Conclusion 

The polynomial derivation height that we have proved in the last section can be reconsidered in the 
following way. The example of a GRS working in C?(«*^) can be used as a clock. Then, since each 
transition of a (non-size increasing) Turing-Machine can be easily simulated by graph rewriting, we can 
state that any PTiME-predicate can be simulated by a lexicographically weighted GRS (up to a polynomial 
reduction). Since lexicographically weighted confluent GRS can be computed in polynomial time (each 
rewriting step can be simulated in linear time), it becomes clear that lexicographically weighted GRSs 
actually characterize PTIME. This provides a precise description of the computational content of the 
method. 

We have implemented a software — called GREW ([grew, loria.fr I — based on the Graph Rewriting 



definition presented in this article. In [12], the software was used to produce a semantically annotated 
version of the French Treebank; in this experiment, the system contains 34 modules and 571 rules and 
the corpus is constituted of 12 000 sentences of length up to 100 words. This experiment is a large scale 
application which shows that the proposed approach can be used in real-size applications. 

As said earlier, despite the global non-confluence of the system, we can isolate subsets of rules that 
are confluent and use our system of modules to benefit from this confluence in implementation. In our 
last experiment, 26 of our 34 modules are confluent, but confluence proofs are tedious. We leave for 
further work the study of the local confluence of terminating GRS and the general study of confluence of 
Graph Rewriting Systems. 
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